The Need of Standardised Metadata to Encode Causal Relationships: Towards Safer Data-Driven Machine Learning Biological Solutions

نویسندگان

چکیده

In this paper, we discuss the importance of considering causal relations in development machine learning solutions to prevent factors hampering robustness and generalisation capacity models, such as induced biases. This issue often arises when algorithm decision is affected by confounding factors. work, argue that integration research assumptions relationships can help identify potential confounders. Together with metadata information, it enable meta-comparison data acquisition pipelines. We call for standardised meta-information practices a crucial step proper development, validation, sharing. Such include detailing process, aiming automatic actionable metadata.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

investigating the effect of motivation and attitude towards learning english, learning style preferences and gender on iranian efl learners proficiency

تحقیق حاضر به منظور بررسی تاثیر انگیزه و نگرش نسبت به یادگیری زبان انگلیسی، ترجیحات سبک یادگیری و جنسیت بر بسندگی فراگیران ایرانی زبان انگلیسی انجام شد. برای این منظور، 154 فراگیر ایرانی زبان انگلیسی در این تحقیق شرکت کردند. سه ابزار جمع آوری داده ها شامل آزمون تعیین سطح بسندگی زبان انگلیسی آکسفورد، پرسشنامه ترجیحات سبک یادگیری براچ و پرسشنامه انگیزه و نگرش نسبت به یادگیری زبان انگلیسی به م...

Principles of metadata organization at the ENCODE data coordination center

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computati...

متن کامل

Towards a Metadata-driven Multi-Community Research Data Management Service

Nowadays, the daily work of many research communities is characterized by an increasing amount and complexity of data. This makes it increasingly difficult to manage, access and utilize to ultimately gain scientific insights based on it. At the same time, domain scientists want to focus on their science instead of IT. The solution is research data management in order to store data in a structur...

متن کامل

Learning Causal Relationships

How ought we learn causal relationships? While Popper advocated a hypothetico-deductive logic of causal discovery, inductive accounts are currently in vogue. Many inductive approaches depend on the causal Markov condition as a fundamental assumption. This condition, I maintain, is not universally valid, though it is justifiable as a default assumption. In which case the results of the inductive...

متن کامل

Metadata Driven Data Transformation

The bottleneck of a data warehouse implementation is the ETL (extraction, transformation, and load) process, which carries out the initial population of the data warehouse and its further (usually periodical) updates. There is a number of software products supporting the OLAP analysis. However, the ETL process implementation is not repeatable in a significant way. This paper reports on a resear...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20837-9_16